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ABSTRACT 

This paper considers the problem of compact source detection on a Gaussian back- 
ground. We rnake a one-dimensional treatment (though a generalization to two or 
more dimensions is possible). Two relevant aspects of this problem are considered: the 
design of the detector and the filtering of the data. Our detection scheme is based on 
local maxima and it takes into account not only the amplitude but also the curvature 
of the maxima. A Neyman-Pearson test is used to define the region of acceptance, 
that is given by a sufficient linear detector that is independent on the amplitude dis¬ 
tribution of the sources. We study how detection can be enhanced by means of linear 
filters with a scaling parameter and compare some of them that have been proposed 
in the literature (the Mexican Hat wavelet, the matched and the scale-adaptive fil¬ 
ters). We introduce a new hiter, that depends on two free parameters (biparametric 
scale-adaptive hiter). The value of these two parameters can be determined, given 
the a priori pdf of the amplitudes of the sources, such that the hiter optimizes the 
performance of the detector in the sense that it gives the maximum number of real 
detections once hxed the number density of spurious sources. The new hiter includes 
as particular cases the standard matched hiter and the scale-adaptive hiter. Then, 
by construction, the biparametric scale adaptive hiter outperforms these hiters. The 
combination of a detection scheme that includes information on the curvature and a 
hexible hiter that incorporates two free parameters (one of them a scaling) improves 
signiheantly the number of detections in some interesting cases. In particular, for the 
case of weak sources embedded in white noise the improvement with respect to the 
standard matched hiter is of the order of 40%. Finally, an estimation of the amplitude 
of the source (most probable value) is introduced and it is proven that such an esti- 
mator is unbiased and it has maximum efhciency. We perform numerical simulations 
to test these theoretical ideas in a practical example and conclude that the results of 
the simulations agree with the analytical ones. 

Key words: methods: analytical - methods: data analysis - techniques: image pro¬ 
cessing 


1 INTRODUCTION 

The detection of compact signals (sources) embedded in a 
bac.kground is a recurrent problem in many helds of As- 
tronomy. Some common examples are the separation of in- 
dividual stars in a crowded optical image, the identifica- 
tion of local features (lines) in noisy one-dimensional spec- 
tra or the detection of faint extragalactic objects in mi- 
crowave frequencies. The detection, identification and re¬ 
moval of the extragalactic point sources (EPS) is funda- 
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mental for the study of the Cosmic Microwave Background 
Radiation (CMB) data (Franceschini et al. 1989, Toffolatti 
et al. 1998, de Zotti et al. 1999). In particular, the contri- 
bution of EPS is expected to be very relevant at the lowest 
and highest frequency channels of the future ESA Planck 
Mission (Mandolesi et al. 1998, Puget et al. 1998). 

The heterogeneous nature of the EPS that appear in 
CMB maps as well as their unknown spatial distribution 
make difficult to separate them from the other physical com- 
ponents (CMB, Galactic dust, synchrotron, etc) by means of 
statistical component separation methods. Techniques based 
on the use of linear filters, however, are well-suited for the 
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task of detecting compact spikes on a background. Several 
techniques based on different linear filters have been pro¬ 
posed in the literature: tlie Mexican Hat Wavelet (MHW, 
Cayon et al. 2000, Vielva et al. 2001a,b, 2003), the classic 
matched filter (MF, Tegmark and Oliveira-Costa 1998), the 
Adaptive Top Hat Filter (Chiang et al 2002) and the scale- 
adaptive filter (SAF, Sanz et al. 2001). A certain deal of 
controversy has appeared about which one, if any, of the 
previous filters is optimal for the detection of point sources 
in CMB data. 

In order to answer that question it is necessary to con- 
sider first a more fundamental issue, the concept of detection 
itself. The detection process can be posed as follows: given 
an observation, the problem is to decide whet.her or not a 
certain signal was present at the input of the receiver. The 
decision is not obvious since the observation is corrupted by 
a random process that we call ‘noise’ or ‘background’. 

Formally, the decision is performed by choosing between 
two complementary hypotheses: that the observed data is 
originated by the background alone ( nuli hypothesis), and 
the hypothesis that the observation corresponds to a com- 
bination of the background and the signal. To decide, the 
detector should use all the available information in terms 
of the probabilities of botli hypotheses given the data. The 
decision device separates the space 1Z of all possible obser¬ 
vations in two disjoint subspaces, 1Z, and 1Z- , so that if an 
observation y £ 1Z- the nuli hypothesis is accepted, and if 
y £ TZ t the nuli hypothesis is rejected, that is, a source is 
‘detected’ ( 7Z * is called the region of acceptance). Hence, we 
will call any generic decision device of this type a detector. 

The simplest example of detector, and one that has been 
extensively used in Astronomy, is thresholding: if the int.en- 
sity of the field is above a given value (e.g. 5<r), a detection 
of the signal is accepted, on the contrary one assumes that 
only background is present. Thresholding has a number of 
advantages, among them the facts that it is straightforward 
and that it has a precise meaning in the case of Gaussian 
backgrounds in the sense of controlling the probability of 
spurious detections. However, it does not use all the avail¬ 
able information contained in the data to perform decisions. 
For example, the inclusion of spatial information (such as 
the curvature) could help to distinguish the sources from 
fluctuations in the background wit.h similar scale but a dif¬ 
ferent shape. 

A general detector that can use more information than 
simple thresholding is given by the Neyman-Pearson (NP) 
decision rule: 


L(xi) 


p{xi\Hi) 

p(xi\H 0 ) 


> L * 


(1) 


where L(xi) is called the likelihood ratio, p(xi\Ho) is the 
probability density function ( pdf) associated to the nuli hy¬ 
pothesis (i.e. there is no source) whereas p(xi\Hi) is the 
pdf corresponding to the alternative hypothesis (i.e. there 
is a source). Xi are a set of variables which are measured 
from the data. L » is an arbitrary constant, which defines 
the region of acceptance 7 Z*, and must be fixed using some 
criterion. For instance, one can adopt a scheme for object 
detection based on maxima. The procedure would consist 
on considering the intensity maxima of the image as candi- 
dates for compact sources and apply to each of them the NP 
rule to decide whet.her they are true or spurious. For a ID 


image, the ratio of probabilities would then correspond to 
the probability of having a maximum wit.h a given intensity 
and curvature (which are the variables Xi in this case) in 
the presence of background plus signal over the probability 
of having a maximum when only background is present. If 
this ratio is larger t.han a given value L*, the candidate is 
accepted as a detection, if not, it is rejected. 

Unfortunately, in many cases the sources are very faint 
and this makes very difhcult to detect them. In order to im- 
prove the performance of the detector, a prior processing of 
the image could be useful. Here is where filtering enters in 
scene. The role of filtering is to transform the data in such 
a way that a detector can perform better than before filter¬ 
ing. Once the detector is fixed, it. is interesting to compare 
the performance of different filters, which has been rarely 
considered in the literature. In a recent work, Barreiro et 
al. (2003) introduce a novel technique for t.lre detection of 
sources based on the study of the number density of max¬ 
ima for the case of a Gaussian background in the presence 
or absence of a source. In order to define the region of accep¬ 
tance the Neyman-Pearson decision rule is used with pdf s 
associated to the previous number densities and using the 
information of bot.li the intensities f; and the curvatures k 
of the peaks in a data set. In addition, L* is fixed by max- 
imising the significance, which is the weight.ed difference be¬ 
tween the probabilities of liaving and not having a source. In 
that work the performances of several filters (SAF, MF and 
MHW) is compared in terms of their reliability, defined as 
the ratio between the number density of true detections over 
the number density of spurious detections. They find that, 
on the basis of this quantity, the choice of the optimal filter 
depends on the statistical properties of the background. 

However, the criterion chosen to fix L* based on the 
significance does not necessarily leads to the optimal relia¬ 
bility. Therefore, if we are considering the reliability as the 
main criterion to compare filters, a different criterion for L», 
based on number densities must be used. In a posterior ar- 
ticle, Vio et al. (2004), following the previous work, adopt 
the same Neyman-Pearson decision rule, based on the pdf s 
of maxima of the background and background plus source, 
to define the region of acceptance. However, they propose 
to find L * by fixing the number density of spurious detec¬ 
tions and compare the performance of the filters based on 
the number density of true detections. In this case, the MF 
outperforms the other two filters. Note that in these last 
two works different criteria have been used to fix L*, thus 
leading to different results. 

In the present work, our goal will also be to find an 
optimal filter that gives a maximum number density of de¬ 
tections fixing a certain number density of spurious sources. 
In order to define the detector, we will use a decision rule 
based on the Neyman-Pearson test. We will consider some 
standard filters (MF, SAF and MH) introduced in the liter¬ 
ature as well as a new filter that we call the Biparametric 
Scale Adaptive Filter (BSAF). In all the filters appears in a 
natural way the scale of the source. We will modify such a 
scale introducing an extra parameter. In faet, it has been 
shown by Lopez-Caniego et. al. (2004) that the standard 
Matched Filter can be improved under certain conditions 
by filtering at a different scale from that of the source. The 
performance of the BSAF will be compared with the other 
filters. 
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The overview of this paper is as follows. In section 2, 
we introduce two useful quantities: number of maxima in a 
Gaussian background in the absence and presence of a local 
source. In section 3, we introduce the detection problem and 
define the region of acceptance. In section 4, we introduce 
an estimator of the amplitude of the source that is proven 
to be unbiased and maximum efficient. In section 5 and 6, 
we obtain different analytical and numerical results regard- 
ing weak point sources and scale-free background spectra 
and compare the performance of the new Hiter with others 
used in the literature. In section 7, we describe the simu¬ 
lations performed to test some theoretical aspects and give 
the main results and finally, in section 8, we summarize the 
conclusions and applications of this paper. Appendix A is a 
sketch to obtain a sufficient linear detector whereas we ob¬ 
tain the linear unbiased and maximum efficient estimator in 
appendix B. 



K 


Figure 1. Value of rib(u, K)/rib for p = 0.7. 


2 BACKGROUND PEAKS AND COMPACT 
SOURCES 

2.1 The background 


w(R, b) 


dx£(x)iR , b). 


( 6 ) 


Let us assume a ID background (e. g. one-dimensional 
scan on the celestial sphere or time ordered data set) rep¬ 
resented by a Gaussian random field £(x) with average 
value (£(*)) = 0 and power Spectrum P(q), q = \Q\: 
(£(Q)£*(Q')> = P(q)Sd(Q — Q '), where £(Q) is the Fourier 
transform of £(x) and Sn is the ID Dirac distribution. The 
distribution of maxima was studied by Rice (1954) in a pi- 
oneering article. The expected number density of maxima 
per intervals (x, x + dx), ( u, u + du) and (k, k + dn) is given 
by 


Ub(u, k) 


rib k 


vMi-y) 


Z + K Z -2 PISK 

2(1 — PD 


( 2 ) 


being rib the expected total number density of maxima (i.e. 
number of maxima per unit interval dx) 
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where v G (—oo, oo) and k G [0, oo) represent the normalized 
field and curvature, respectively. <r 2 is the moment of order 
2 n associated to the field. 9 c ,9 m are the coherence scale of 
the field and maxima, respectively. As an example, figure 1 
shows the values of the ratio rib(u, K,)/rib for the case p = 0.7 
(a typical value for the backgrounds we are considering). In 
this case, the expected density of maxima has a peak around 
v ~ 0.8 and k ~ 1.1, that is, most of the peaks appear at 
a relatively low threshold and curvature, and the density of 
peaks decreases quickly for extreme values of u and k. 

If the original field is linear-filtered with a circularly- 
symmetric filter 'I'(æ; R, b), dependent on 2 parameters (R 
defines a scaling whereas b defines a translation) 


T(ir; R, b) 



\x-b\ 

R 


(5) 


we define the filtered field as 


Then, the moment of order 2 n of the linearly-filtered field is 

/»OO 

al = 2 l dq q 2n P(q)r/j 2 (Rq), (7) 

Jo 

being P(q) the power Spectrum of the unfiltered field and 
rp(Rq) the Fourier transform of the circularly-symmetric lin¬ 
ear filter. 


2.2 The presence of a local source 


Now, let us consider a position x in the image where a 
Gaussian source (i.e. profile given by r(x) = exp(— æ 2 /2 R 2 ), 
where R is the beam width) is superimposed to the previous 
background. Then, the expected number density of maxima 
per intervals ( x, x + dx), {v, u + du) and (k, k + dn), given 
a source of amplitude A in such spatial interval, is given by 
(Barreiro et al. 2003) 


n(u, k,\u s ) 


rib « 

\/27r(l - p 2 ) 


_ ( u — u s)^ F K s —2p(iv — i/ s )(k, — k s ) 

e ^ , (8) 

where u G (—oo, oo) and k G [0, oo), u a = A/a o is the nor¬ 
malized amplitude of the source and k s = —At'A/cji is the 
normalized curvature of the filtered source. The last expres- 
sion can be obtained as 

ft 2 t" 

Vm'ib 

k s = u s y 3 , y s = - 

P 

Note that due to the statistical homogeneity and isotropy of 
the background, the previous equations are independent of 
the position of the source. 

We consider that the filter is normalized such that the 
amplitude of the source is the same after linear filtering: 
f dxr(x)'i/(x-,R,b) = 1 . 


-i 


= 2/ dqq 2 T{q)rp(Rq).{9) 
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3 THE DETECTION PROBLEM 


We want to choose between different filters based on detec- 
tion. To make such a decision, we will focus on the follo- 
wing two fundamental quantities: a) the number of spurious 
sources which emerge after the filtering and detection pro¬ 
cesses and b) the number of real sources detected. As we 
will see in this section, these quantities are properties of the 
Gaussian field and source that can be calculated from equa- 
tions ( 2 ) and ( 8 ). As we will see, the previous properties 
are not only related to the SNR gained in the filtering pro- 
cess but depend on the filtered momenta up to 4th-order 
(in the ID case), i.e. the amplification and the normalized 
curvature. 

Let us consider a local peak in the ID data set charac- 
terised by the normalized amplitude and curvature (u 3 ,k s ). 
Let Hq\ n.d.f. n b (u , k) represents the nuli hypothesis, i.e. the 
local number density of background maxima, and H i: n.d.f. 
n(u, k) represents the alternative hypothesis, i.e. the local 
number density of maxima when there is a compact source: 


n(u, k) = / du 3 p{u 3 )n(u, k\u 3 ). ( 10 ) 

Jo 

In the previous equation, we have introduced a priori infor¬ 
mation about the probability distribution of the sources: we 
get the number density of source detections weighting with 
the a priori probability p(u 3 ). 

To construct our detector, we will assume a Neyman- 
Pearson (NP) decision rule using number densities instead 
of probabilities: 


L(u , k) 


n (^«) > L 

n b (u,n) ~ 


( 11 ) 


where L* is a constant. The previous equation defines a re¬ 
gion in (u, k), the so-called region of acceptance TZ„- There- 
fore, the decision rule is expressed such that if the values of 
(u,k) of the candidate maximum is inside TZ, (i.e. L > L*) 
we decide that the signal is present. On the contrary, if 
L < L* we decide that the signal is absent. ^ 

It can be proved that the previous region of accep¬ 
tance 7Z* is equivalent to the sufficient linear detector (see 
Appendix A) 


v and curvature k). Moreover, this detector is independent 
of the pdf of the source amplitudes. 

Using this detector, the expected number density of 
spurious sources and of true detections are given by 

n b = dudnnb(u, k), (15) 

Jiz, 


n* 



du dn n(u, k). 


(16) 


We remark that in order to get the true number of real 
source detections such a number must be multiplied by the 
probability to have a source in a pixel in the data set. 

Note that for a fixed number density of spurious sources 
ni, the NP detector leads to the maximum number density 
of true detections n*. 

Taking into account equations (13) to (16), one can find 
n b and n* for a Gaussian background. After a straightfor- 
ward calculation, the number density of spurious sources 
found using the NP rule is given by: 


n b 


Ub 

2 


erfc 


( ip* y 7 ! - p 2 \ 

\>/ 2 (i -pys)) 


+V2My s e M ^‘erfc 



1 -pys 



,(17) 


M = 


1 — p 2 


2(1 - 2pp s + y|)' 

Similarly, the number density of detections is obtained 


n b 


1 - py s 


V^(p + yl)^T^ 


(1 — P Z )V Z 


dipl((p)[l + B(z)]e 2 i 1 -pys) 2 


(18) 


where 


VsP 


1 — p 2 


1 - py* \ 2 (p + y 2 )’ 


TZ * : p{u,k) > <p*, (13) 

where <p* is a constant and ip is given by 

(14) 

We remark that the assumed criterion for detection leads to 
a linear detector <p (i.e. linear dependence 011 the threshold 


D ,\ /— z 2 r f \ _ (1 Py s) 

B{z) = Vnze erfc(-*), p = —- 5 —, 

1 -p 2 

/»OO 

I(p) = / du s p{u s )e^ v -^ +v2 ‘\ 

Jo 


(19) 


t Note that the region defined by equation (11) is equivalent to 
the one defined by the usual Neyman-Pearson test in terms of 
probabilities 


Pi", K ) 

p b (u, k) 


> L' t 


( 12 ) 


where p b (u, re), p(u , n) are the pdf s associated to the number den¬ 
sities given by equations (2) and (10) and, in order to compare 
different filters, the constant tf must be found by fixing the num¬ 
ber density of spurious sources in the region of acceptance instead 
of the false alarm probability. 


4 THE ESTIMATION OF THE AMPLITUDE 
OF THE SOURCE 


The signal has an unknown parameter, the amplitude A, 
that has to be estimated from the data (u,i t). We shall 
assume that the most probable value of the distribution 
n(u , K\u a ) gives an estimation of the amplitude of the source 
(criterion for amplitude estimation). The result u s is given 
by the equation 


Vs + P ’ 


( 20 ) 
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where the function ip is given by equation (14). One can 
prove that the previous expression corresponds to a linear 
estimator that is unbiased and maximum efficient (minimum 
variance), i.e. 


(u a ) = v , a l s 


1 

Vs + n’ 


( 21 ) 


where () denotes average value over realizations (see 
Appendix B). 


5 ANALYTICAL RESULTS 
5.1 Filters 

We will consider as an application the detection of 
compact sources characterised by a Gaussian profile 
t(x) ~ exp(—® 2 /2 R 2 ), and Fourier transform r = 
Rexp(—(qR) 2 /2), though the extension to other profiles will 
be considered in the future. Such a profile is physically and 
astronomically interesting because it represents the convolu- 
tion of a point source (Dirac 5 distribution) with a Gaussian 
beam. 

The source profile above includes a “natural scale” R 
that characterises the source. This is a fundamental scale 
that will appear in all the filters we will consider here. By 
construction, the standard MF and SAF operate on this 
scale, as well as the canonical MHW at the scale of the 
source. However, it has been shown that changing the scale 
at which the MHW and the MF filter the image can im- 
prove its performance in terms of detection (Vielva et al. 
2001a, Lopez-Caniego et al. 2004). Following this idea, we 
will introduce another degree of freedom in all the filters 
that allows us to change their scale in a continuous way 
(similarly to the scaling of a continuous wavelet). This de¬ 
gree of freedom is obtained by multiplying the scale R by 
a new parameter a > 0. We will show that with this new 
parameter the improvement in the results is significant. 



x=qR (7-0) 


Figure 2. Different filters for the values of 7 = 0 (lower panel) 
and 7=1 (upper panel). The filters represented in all cases are: 
the SAF (solid line), MF (dotted line), MH (short dashed line), 
which are shown for a = 1, and the BSAF (long dashed line) 
which is given for (o, c) = (0.3,—0.86) for 7 = 0 and (a,c) = 
(0.4, —0.68) for 7 = 1. Note that for 7 = 1 the SAF and the MF 
coincide. 


a Gaussian profile for the source, the previous set of equa- 
tions lead to the filter 


tpSAF 


N(a)x 1 e 2X 


1 + 



x = qaR, 


m = 


1 + 7 
2 



A ee 


2 a 2 

1 + a 2 ’ 


5.1.1 The scale-adaptive filter (SAF) 

The idea of a scale-adaptive filter (or optimal pseudo- 
filter) has been recently introduced by the authors (Sanz 
et al. 2001). By introducing a circularly-symmetric filter, 
'F(.'r; R, b), we are going to express the conditions in order to 
obtain a scale-adaptive filter for the detection of the source 
s(x) at the origin taking into account the faet that the source 
is characterised by a single scale R 0 . The following condi¬ 
tions are assumed: (1) (w(R 0 ,0 )) = s(0) = A , i.e. w(R o ,0) 
is an unbiased estimator of the amplitude of the source; (2) 
the variance of w(R,b) has a minimum at the scale R 0 , i.e. 
it is an efficient estimator; (3) w(R, 0) has a maximum with 
respect to the scale at (_R o ,0). Then, the filter satisfying 
these conditions is given by (Sanz et al. 2001) 


IpSAF = 


1 


t(q) 


ac — b 2 P(q) 


b + c — (a + b) 


dlnr 


dlnq 


a = / dq— f b = 


t dr _ / , 2 1 (dr 

dqq pTq' C = dqq p[d-q 


, ( 22 ) 


Assuming a scale-free power Spectrum, P(q) cx q 7 , and 


N(a) 


a 1 

A™r (m) 1 + aa’ 

v 7 m 


(23) 


where we have modified the scale as aR. 

In this case the filter parameters 9 m , p and the curva- 
ture of the source y s are given by 


ftn = I 1 + £ + 5 
aR y (1 + m) ( 1 + £ + «£a) 


1 + A + 


1 + m 


Vo+S)(i+ 


{2 24 ( 2 + 4 )' 


= H, 


1 + b 


1 (1 + m) (l + 


, 24 ( 2 + 4 )) 
+ m‘1 ) 


H = a 1+ rn2 + t ( 1 + m ) A 

~ 1 + Aa m 2 +t(l + m) 


(24) 


Figure 2 shows the SAF for two values of the spectral 
index. 
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5.1.2 The matched filter (MF) 


If one removes condition (3) defining the SAF in the previous 
subsection, it is not difficult to find another type of filter 
after minimization of the variance (condition ( 2 )) with the 
constraint ( 1 ) 


lp MF 


1 T{q) 

2a P(q) 


(25) 


This will be called matched filter as is usual in the literature. 
Note that in general the matched and adaptive filters are 
different. 

For the case of a Gaussian profile for the source and a 
scale-free power Spectrum given by P(q) oc q 1 , the previous 
formula leads to the following modified matched filter 


4>mf = N(a)x' 1 e 2x , 


x = qaR, 


(26) 


N{a) 


a 

A m F (m) 


where m and A is given by equation (23) and we have in- 
cluded the scale parameter a. 

Figure 2 shows the MF for the case a = 1 (standard 
MF) and values of the spectral index 7 = 0,1. We remark 
that for 7=1 the scale-adaptive filter and the matched 
filter coincide, and for 7 = 2 (not shown in the figure), the 
matched filter and the Mexican Hat wavelet are equal. 

For the MF the parameters 9 C , 9 m , p and the curvature 
of the source y 3 are given by 


9m 

aR 


1 _ / m 

y/l + m ’ P V 1 + m’ 


Vs = pA 


(27) 


We remark that the linear detector <p(v, n) is reduced to 
ip = v for the standard Matched Filter (a = 1). i.e. curvature 
does not affect the region of acceptance for such a filter. 


5.1.3 The mexican hat wavelet (MH) 

The MH is defined to be proportional to the Laplacian of 
the Gaussian function: 

ipMH(y) OC (1 - y 2 )e~ v /2 . (28) 

Thus, in Fourier space 

ipMH(q) = -?=(qRf e -^ qR)2 . (29) 

Wn 


I 11 this case the filter parameters 9 m , 9 C , p and the 
curvature of the source y s are given by 


9 m 

R 


1 n _ j 2 + t " _ 3/2 

V3 + t V 3 + 1 y/(2 + t)(3 + t ) 


(30) 


The generalization of this type of wavelet for two dimensions 
has been extensively used for point source detection in 2D 
images (Cayon et al. 2000, Vielva et al. 2001, 2003). As for 
the previous filter, the MH is modified by including the scale 
parameter a in the form 


IpMH 


N(a)x 2 e 2X , x = qaR, 


(31) 


N(a) 


2a 

ØFA 3 / 2 


For the MH the parameters 9 C , 9 m , p and the curvature 
of the source y s are given by 


dm 

aR 


1 _ j 2 +t _ 3A/2 

y/3 Ft P V 3 + t’ Vs y/(2 + t)(3 + t) 


(32) 


Figure 2 shows the MH for different values of the spec¬ 
tral index. 


5.1.4 The biparametric scale adaptive filter (BSAF) 

If one removes condition (3) defining the SAF in subsection 
5.1.1 and introduces the condition that w(R 0 , b) has a spatial 
maximum in the filtered image at b = 0 , i.e. (w"(R 0 , 0 )) < 0 , 
it is not difficult to find another type of filter 

(1+c(gÆ) 2 ), (33) 

where c is an arbitrary constant that can be related to the 
curvature of the maximum. We remark that the constraint 
{w'(R 0 , 0 )) = 0 is automatically satisfied for any circularly- 
symmetric filter if the source profile has a maximum at the 
origin. 

For the case of a scale-free power spectrum, the filter is 
given by the parametrized equation 

'ipBSAF = -^j~x 1 e~ 2X (1 + cx 2 ), x = qaR. (34) 

where we have modified the scale as aR. Hereinafter, we 
will call this new filter containing two arbitrary parameters, 
q > 0 and c, the biparametric scale-adaptive filter (BSAF). 
A calculation of the different moments leads to 


9 m 

aR 


G 


7+2 


G. 


7+2 


G7+4’ x /G 7 G 7+4 ’ 


y s = 


-'7+2 


g 7 


g 7+4 ’ 


(35) 


where m and t are defined in equation (23) and G 7 and J 1 
are given by 


G 7 = i[l + 2 mc + m(m + l)c 2 ]r(m), 


(36) 


J 7 (q) = — [1 + mcA] A m F(m). 


(37) 


Note that the BSAF contains all the other considered 
filters as particular cases: the MF is recovered for c = 0, 
when c = t/m 2 the BSAF defaults to the SAF and, finally, 
the MH wavelet is obtained in the two cases: 7 = 0, c 1 
and 7 = 2 , c = 0 . 


5.2 A priori probability distributions p(iz s ) 

We will test two different pdf p(v 3 )-. a uniform distribution 
in the interval 0 < v < v c and a scale-free distribution 
with a lower and upper cut-off Vi < v < v;. I 11 particu¬ 
lar, we will especially focus 011 values for the cut-off’s that 
lead to distributions dominated by weak sources. It is in this 
regime where sophisticated detection methods are needed, 
since bright sources can be easily detected with simple tech- 
niques. 


5.2.1 Uniform distribution 
In this case, 

p(v/) = —, v s € [0, Vc\. (38) 

Vc 
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This allows us to obtain 




2 V c yjy1 + p . 


erf(ft) + erf ( -^= \Jyl + /j, - h 


h = 


\/2(y2 + aO 


(39) 


In general, we will consider a cut-off in the amplitude of 
the sources such that v c = 2 after filtering with he standard 
MF. Note that this correspond to different thresholds for 
the rest of the filters. 


5.2.2 Scale-free distribution with lower and upper cut-off 
In this case, 

p(v s ) = Nuf 0 , v G [i Vi, u f ], f3 ^ 1, (40) 

where the normalization constant N and I(tp) are 


N = 


ft- 1 1 



I(<p) = N 



duu~ f3 e , ' lip -^ {,1+Vs)2] . 


(41) 


(42) 


In general, we will consider /3 = 0.5 and u- L = 0.5, Uf = 3 
after filtering with the standard MF and the corresponding 
thresholds for the other filters. 


6 NUMERICAL RESULTS 

For a fixed number density of spurious sources ni , we want 
to find the optimal filter that produces the maximum num¬ 
ber density of true detections n* for different spectral indices 
( 7 ), values of R and point source distributions. In order to 
do this, we first obtain implicitly the value of </?* from equa- 
tion (17) (for a fixed value of ni) and then substitute it in 
equation (18) to calculate n*. We consider two different dis¬ 
tributions of sources to test the robustness of the method: 
a uniform distribution and a scale-free distribution. Given 
that bright point sources are relatively easy to detect, we 
mainly concentrate 011 the more interesting case of weak 
sources. In any case, we also mention some results for dis¬ 
tributions containing bright sources. 

We remark that the BSAF has an additional degree of 
freedom, the parameter c, as it appears in equation (34). 
Note that the BSAF and the SAF are not the same fil¬ 
ter. The parameter c in the BSAF can take any positive 
or negative value, while the coefBcient t/m 2 , for the SAF, is 
a known function of 7 . By construction, the BSAF always 
outperforms the MF and SAF or, in the worst case, defaults 
to the best of them. 


6.1 Uniform distribution 

6.1.1 Weak sources 

As a first case, we consider a uniform distribution of sources 
with amplitudes in the interval A G [0, 2]tro, where ao is 
the zero-order moment of the linearly-filtered map with the 



Figure 3. Uniform distribution. The expected number density of 
detections n* as a function of the filter parameter a for 7 = 0 for 
the BSAF (where c has been obtained by maximising the number 
of detections for each value of a), MF, SAF and MH filters. We 
consider the case I? = 3, nj = 0.05 and u c = 2. 



Figure 4. Uniform distribution. The expected number density of 
detections n* as a function of the filter parameter a for 7 = 0.5 for 
the BSAF, MF, SAF and MH filters. We consider the case R = 3, 
ni = 0.05 and u c = 2. As in the previous figure the parameter c 
of the BSAF has been determined by maximising the number of 
detections for each value of a. 


standard MF. Therefore, the threshold v in the image fil¬ 
tered with this filter is in the interval [0, 2]. Thus, the corre¬ 
sponding upper limit for v in the original (unfiltered) map is 
below 2 , what means that we are considering the detection 
of weak sources. 

As a reference example, in figure 3, we plot n*, the 
number density of detections, as a function of a for the case 
7 = 0, ni = 0.05 and R = 3, where R is given in pixel units. 
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Figure 5. Uniform distribution. The expected number density of 
detections n* as a function of the filter parameter a for 7 = 1.5 for 
the BSAF, MF, SAF and MH filters. We consider the case R = 3, 
nj* = 0.05 and u c = 2. As in the previous figure the parameter c 
of the BSAF has been determined by maximising the number of 
detections for each value of a. 

For completeness, the theoretical values of n* are given, in 
this figure, for values of a down to zero (note that n* —> 
ni when a —> 0). However, from a practical point of view, 
we do not expect the theoretical results to reproduce the 
values obtained for a pixelized image when filtering at small 
scales (since the effect of the pixel is not taken into account). 
Therefore, hereinafter, we will only consider those results 
obtained when filtering at scales larger (or of the order) of 
the pixel size, which corresponds to a ^ R~ 1 . Taking into 
account this constraint, the best results are obtained for 
a ~ 0.3 for the BSAF, that clear ly outperforms the standard 
MF (i.e., a = 1) with an improvement of the 40% in n*. If 
we compare with the MF at a = 0.3, the improvement is of 
~ 20 %. 

In figure 4, we give the same results for the case 7 = 0.5. 
In this case, the BSAF at a = 0.3 improves again signif- 
icantly the standard MF, with an increase in the number 
density of detections of ~ 25%. 

As 7 increases, the improvement of the BSAF with re- 
spect to the standard MF decreases. In faet, for values of 
1 < 7 < 2 they produce very similar results. As an example, 
we give the number of detections achieved for each filter for 
the case 7 = 1.5, ni = 0.05 and R = 3 in Fig. 5. It can 
be seen that the maximum number of detections is approx- 
imately found for the standard MF. However, we would like 
to point out that the SAF and MH wavelet at the optimal 
scale give approximately the same number of detections as 
the standard MF. These results show the importance of fil¬ 
tering at scales aR instead of the usual scale of the source 
R. 

This can also be seen in Fig. 6 , that summarizes how the 
relative performance of the considered filters with respect to 
the standard MF changes with the spectral index 7 (again 
for ni = 0.05 and R = 3). For each filter, the results are 
given for the optimal scale (and parameter c in the case of 



Figure 6. Relative difference in the number of detections, with 
respect to the standard MF, as a function of the spectral index 7 . 
Values of = 0.05 and R = 3 are used for a uniform distribution 
of weak sources ( v c = 2). For each filter, the results are shown for 
the optimal parameters. 


BSAF). The improvement of the BSAF with respect to the 
MF ranges from ~ 40% (for white noise) to zero (for the 
largest values of 7 ). We would also like to point out that the 
MH at the optimal scale performs similarly to the standard 
MF. In addition, the MH has an analytical expression which 
makes it very robust and easy to implement. Therefore, it 
can be a useful filter in some practical cases. 

We have also explored how the previous results change 
when varying ni and R. In particular, we have considered 
vaules of ni in the interval 0.01 - 0.05, R = 2 and R = 3 
and values of 7 = 0, 0.5, 1. The results are summarized in 
table 1 for the BSAF and the standard MF (we present 
only those cases where the BSAF improves at least a few 
per cent the standard MF). The values of a and c for the 
BSAF are found as the ones that maximise n* in each case. 
The relative performance of the BSAF with respect to the 
standard MF, improves when ni increases. For instance, for 
R = 3 and 7 = 0 , the improvement decreases from ~ 40% 
(for ni = 0.05) to ~ 20% (for ni = 0.01). On the other hånd, 
as R increases, the difference between the detections found 
by both filters also increases. In particular, for ni = 0.05 and 
7 = 0, the improvement goes up from ~ 21% (for R = 2) to 
~ 40% (for R = 3). 


6.1.2 Bright sources 

In order to test how the previous results are affeeted by 
the presence of bright sources, we have also considered a 
uniform distribution that contains a mixture of weak and 
bright sources with amplitudes in the interval A £ [0, 5]oo. 
In particular, we have considered the reference example with 
values 7 = 0 , ni = 0.05 and R = 3. For this case, we find 
that the optimal values of the parameters for the BSAF are 
c = —0.79 and a = 0.3. The behaviour is similar to the one 
found in the weak sources case, although the improvement 


© 0000 RAS, MNRAS 000, 000-000 






Filter design 9 


R 

n b 

7 

a 

c 

n BSAF 

n *MF 

RD[%] 

2 

0.01 

0 

0.4 

-0.69 

0.0860 

0.0824 

4.4 

2 

0.03 

0 

0.4 

-0.68 

0.1493 

0.1311 

13.9 



0.5 

0.4 

-0.59 

0.1512 

0.1474 

2.5 

2 

0.05 

0 

0.4 

-0.70 

0.1900 

0.1575 

20.6 



0.5 

0.4 

-0.59 

0.1935 

0.1783 

9.0 

3 

0.01 

0 

0.3 

-0.86 

0.0784 

0.0658 

19.1 

3 

0.03 

0 

0.3 

-0.86 

0.1282 

0.1013 

26.5 



0.5 

0.3 

-0.73 

0.1242 

0.1145 

8.4 

3 

0.05 

0 

0.3 

-0.86 

0.1654 

0.1186 

39.4 



0.5 

0.3 

-0.75 

0.1616 

0.1352 

19.5 



1 

0.4 

-0.58 

0.1582 

0.1487 

6.3 


Table 1. Uniform distribution. Number density of detections n* 
for the standard MF (o = 1) and the BSAF with optimal values 
of c and a. RD means relative difference of number densities in 
percentage: RD = 100(—1 + n *B S AF / n *M f) ■ 


of the BSAF versus the MF is lower (21% as compared to 
the previous 40%). 

It is interesting to note that for fixed values of the pa¬ 
rameters ni , 7 and R, the optimal values of a and c are very 
similar for both the weak and bright sources cases, which is 
an indication of the robustness of the technique. 


6.2 Scale-free distribution 

6.2.1 Weak sources 

For comparison purposes, we have repeated our analysis 
using a scale-free power-law distribution of sources with 
P = 0.5 and amplitudes in the interval A £ [0.5, 3]cro, where 
ao is the dispersion of the map filtered with the standard 
MF. Therefore, we are considering the case of weak sources 
since the corresponding upper limit for v in the original (un- 
filtered) maps is below 3. 

In figures 7 and 8, we plot n*, the number density of de¬ 
tections, as a function of a for the cases 7 = 0 and 7 = 0.5, 
assuming R = 3 and ni = 0.05. In figure 7, n* is signifi- 
cantly higher for the BSAF compared with the other filters 
at certain values of a. In this case, the improvement of the 
BSAF at en = 0.3 compared with the standard MF is ~ 42%. 
If we compare with the MF at a = 0.3, the improvement is 
~ 33%. In figure 8 , with 7 = 0.5, an improvement of ~ 20% 
is obtained for the BSAF at a = 0.3 with respect to the 
standard MF. 

As in the uniform distribution case, the BSAF gives 
very similar results to the MF in the range 1 < 7 < 2. In 
figure 9 we show the results for 7 = 1.5, ni = 0.05 and 
R = 3. Again we see that the optimal BSAF defaults to the 
standard MF. 

These results are summarized in Figure 10, which shows 
the relative difference in the number of detections, with re¬ 
spect to the standard MF, as a function of the spectral index 
7 for the different filters. At each point the optimal scale 
(and parameter c in the case of BSAF) has been used. We 
remark that for the interesting case of white noise more than 


MF solid 
SAF shortdash 
MH long dash 
BSAF dot - dash 



Figure 7. Scale-free distribution. The expected number density 
of detections n* as a function of the filter parameter a for 7 = 0 
for the BSAF (using the optimal values of c), MF, SAF and MH 
filters. We consider the case R = 3, = 0.05, z/j = 0.5, Vf = 3 

and (3 = 0.5. 



Figure 8. Scale-free distribution. The expected number density 
of detections n* as a function of the filter parameter a for 7 = 0.5 
for the BSAF (using the optimal values of c), MF, SAF and MH 
filters. We consider the case R = 3, = 0.05, z/j = 0.5, Vf = 3 

and (3 = 0.5. 

a 40% of detections is gained with respect to the standard 
MF. 

We have also explored how the results depend on the 
values of R and n£. In table 2, we show the number density 
of detections for the BSAF and for the standard MF (a = 1) 
for R = 2 and R = 3, with ni ranging from 0.01 to 0.05, 
and for values of 7 = 0,0.5,1 (we only include the results for 
those cases where the relative difference between the BSAF 
and standard MF is at least a few per cent). We also give 
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Figure 9. Scale-free distribution. The expected number density 
of det eet ions n* asa funetion of the filter parameter a for 7 = 1.5 
for the BSAF (using the optimal values of c), MF, S AF and MH 
filters. We consider the case R = 3, = 0.05, Vi = 0.5, Vf = 3 

and [3 = 0.5. 



Figure 10. Relative difference in the number of detections, with 
respect to the standard MF, as a funetion of the spectral index 
7. The results are shown for a scale-free distribution (/3 = 0.5, 
vi = 0.5, Uf = 3) for values of n £ = 0.05 and R = 3. 

the optimal values of c and a where the BSAF performs 
better (taking into account the constraint aR 1). As for 
the previous case of the uniform distribution, the relative 
performance of the BSAF improves when inereasing R and 
Mb¬ 
it is also interesting to consider other values of the pa¬ 
rameter (3. For instance j3 £ [2.2, 2.5] has an intrinsic inter- 
est for astronomy, because they describe the distribution of 
compact sources in the sky at microwave wavelengths. We 
have also explored the performance of the filters for these 
values (for the reference case 7 = 0 , ni = 0.05, R = 3) 


R 

n l 

7 

a 

c 

n BSAF 

77 * 

n MF 

RD[%] 

2 

0.01 

0 

0.4 

-0.66 

0.1659 

0.1590 

4.3 

2 

0.03 

0 

0.4 

-0.68 

0.2376 

0.2089 

13.7 



0.5 

0.4 

-0.56 

0.2451 

0.2432 

7.8 

2 

0.05 

0 

0.4 

-0.68 

0.2772 

0.2311 

19.9 



0.5 

0.4 

-0.57 

0.2873 

0.2705 

6.2 

3 

0.01 

0 

0.3 

-0.83 

0.1336 

0.1180 

13.2 

3 

0.03 

0 

0.3 

-0.83 

0.1975 

0.1512 

30.6 



0.5 

0.3 

-0.71 

0.1937 

0.1767 

9.6 

3 

0.05 

0 

0.3 

-0.81 

0.2335 

0.1639 

42.5 



0.5 

0.3 

-0.70 

0.2321 

0.1928 

20.4 



1 

0.3 

-0.62 

0.2271 

0.2169 

4.7 


Table 2. Scale-free distribution. Number density of detections n* 
for the standard MF(ct=l) and the BSAF with optimal values of 
c and a. RD = 100(—1 + n* BsAF / n* MF ). 


and the improvement of the BSAF (with optimal values of 
a = 0.3 and c = —0.86) versus the standard MF is still 
significant and of the order ~ 35%. 


6.2.2 Bright sources 

To test the effeet of the presence of bright sources on our 
results, we have also considered a scale-free distribution with 
A £ [0.5, 5]tro (i.e., a mixture of weak and bright sources) for 
(3 = 0.5. We find, for the reference case (7 = 0, ni = 0.05, 
R = 3), that the BSAF improves the standard MF around 
a 25%, with optimal parameters a = 0.3 and c = —0.76. 

We would like to point out that for for a given set of 
7 , R and ni, this distribution of weak and bright sources 
leads to very similar optimal parameters for the BSAF as 
the scale-free distribution of weak sources. 

In addition, we have also tested the performance of the 
filters for a scale-free distribution of bright sources with A £ 
[3,5]cto, for the same case as before (7 = 0, ni = 0.05, 
R = 3). We explore the parameter space of (c, a), looking 
for the best filter regarding detection. We find that, for this 
distribution, the optimal parameters for the BSAF are c = 0 
and a. = 1, that is, the BSAF defaults to the standard MF. 


6.3 On the robustness of the filters 

The filters considered here depend on a number of param¬ 
eters (a in the case of SAF, MF and MH and a and c in 
the case of BSAF) that must be determined in order to get 
the maximum number of detections for a fixed number of 
spurious detections. While for a given filter the region of 
acceptance is explicitly independent of the source distribu¬ 
tion, the methodology presented here for the estimation of 
the optimal filter parameters depends on some assumed pa¬ 
rameters of the source distribution (namely /3, 17 and Uf) 
and the noise power Spectrum ( 7 ). A full study of the ro¬ 
bustness of the method for all the filters is out of the scope 
of this work. However, we have considered some interesting 
cases as tests of the robustness of the method. 
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6.3.1 Robustness with respect to the assumed source 
distribution 


In order to ascertain to what extent the uncertainties in 
the (5 parameter of the source distribution affects the de¬ 
termination of the optimal filter parameters, we repeated 
our calculations using wrong assumptions on its value. An 
interesting case corresponds to assume that the source dis¬ 
tribution is uniform when, in reality, it is scale-free and vice 
versa. In order to do this, we first construct the BSAF using 
the optimal ( a , c) values that were obtained for the uni¬ 
form distribution. We calculate then the number density of 
sources n* obtained from a map that contains sources that 
follow a scale-free distribution with /3 = 0.5. We find that 
the differences in the number of detections when using the 
wrong filter with respect to using the optimal one are very 
small (lower than 0.1%). The same happens if the filter is 
constructed assuming an underlying scale-free distribution 
and applied to a map with sources uniformly distributed. 
This is not surprising, since tables 1 and 2 show that both 
uniform and scale-free distributions lead to similar values of 
the optimal a and c parameters. 

Another source of uncertainty that appears in any real 
case is the value of the lirniting cut-offs of the source distri¬ 
bution, Vi and Vf. We have seen in the previous subsections 
that for a given set of values ni, 7 and R , different cut-offs 
for the same distribution lead to similar optimal a and c 
parameters. For instance, in our reference example (7 = 0, 
ni = 0.05, R = 3) and a uniform distribution of sources with 
Vi = 0 and Vf = 2 the optimal filter parameters are a = 0.3 
and c = —0.86, whereas if the upper cut-off value is Vf = 5, 
the parameters take the values a = 0.3 and c = —0.79. Then, 
the shape of the optimal filter is only weakly dependent on 
the value of the cut-offs. This suggests that the method- 
ology presented here is robust against uncertainties in the 
prior knowledge of the cut-offs of the distribution. 

I 11 order to test this idea we proceeded in an analogous 
way to the case of the (3 parameter explained before: we 
apply wrong filters (that is, filters whose parameters have 
been determined assuming wrong values of the cutoffs) to 
test cases with real distributions of sources. We tested sev- 
eral interesting cases: for uniform distributions, we studied 
the case when the sources are assumed to be weak but in 
reality some of them are bright (in our example, Vf is as¬ 
sumed to be 2 but in reality its true value is Vf = 5) and the 
opposite situation. For the scale-free distribution, we stud¬ 
ied the effect of mistaking the lower cut-off value (assuming 
Vi = 0.5 instead of its true value Vi = 3 and vice versa). For 
all the cases, we plotted the curves n» versus a. We observe 
that using a wrong filter changes the number of detections 
of all the filters, but the qualitative behaviour of the n* - 
a curves does not change. The relative behaviour of the fil¬ 
ters is basically the same, and therefore the conclusions we 
obtained in the previous sections are still valid. 

Thus, we conclude that the uncertainty in the knowl¬ 
edge of the source distribution is, in general, not a critical 
issue in the cases we have considered. 


0.15 



0 0.5 1 1.5 


Figure 11. Filter performances when the 7 parameter of the 
background is poorly known. A uniform distribution is considered. 
The true value of the background power Spectrum index is 7/, = 
0.5 whereas the filters have been constructed with a wrong 7,,, = 
0.6. Note how the figure compares with figure 4. 

6.3.2 Robustness with respect to the assumed power 
Spectrum 

A more dclicate issue is the one related to the assumption 
of the 7 parameter. If one assumes a value for 7 that is very 
different from the true one, the shape of the filters changes 
dramatically (except in the case of the MH whose shape is 
independent of 7 ) with respect to the optimal ones and this 
may lead to wrong results. However, note that there are very 
well established techniques to estimate the power Spectrum. 
Albeit in this academic case we consider power law-type 
backgrounds, it is straightforward to apply the method to 
any kind of power Spectrum that can be present in the data. 

As an example, we consider a case where the back¬ 
ground corresponds to a true value 7 1 = 0.5 whereas the 
filters have been constructed with a wrong 7 w = 0 . 6 , that 
is, a 20% error in the determination of 7 . The resulting a—n * 
curves are given in figure 11. The behaviour of the BSAF, 
the MF and the SAF is qualitatively similar to the case 
where the noise power Spectrum is perfectly known, but the 
performance of the three filters is poorer. The MH curve is 
identical to the ideal case since the shape of the filter does 
not depend on 7 . The BSAF still outperforms all the other 
filters, although the improvement in the number of detec¬ 
tions with respect to the MF slightly decreases. 


7 SIMULATIONS 

I 11 order to see how our theoretical framework works in a 
practical example, we run a large set of simulations and 
study the performance of the NP detector after filtering 
them with the different filters considered in the previous sec¬ 
tions. We choose as a practical example the interesting case 
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of a Gaussian background characterised by a white noise 
power Spectrum (7=0) and sources whose intensity distri¬ 
bution is uniform. We will focus on the detection of weak 
sources. For the sake of simplicity, we give the results only 
for the BSAF and the MF since the other two filters (SAF 
and MH) perform worse in the considered case. 

7.1 The simulations 

The different simulations are performed as follows. The im¬ 
ages contain a number N = 4096 pixels, whicli is sufficiently 
large so that the addition of a single source does not modify 
significantly the dispersion of the images. The background is 
generated as a random held with dispersion (before Hitering) 
a 0 unf = 1 (in arbitrary units). 

The sources that we have considered for this exam- 
ple have a characteristic scale R = 3 pixels. Since we are 
interested in the detection of weak sources, we add point 
sources with a uniform amplitude distribution in the interval 
A £ [0,0.86] in the same arbitrary units of the background. 
The images filtered with the standard MF (a = 1) for this 
scale (R = 3) have dispersion ao — 0.43. Thus, the sources 
are distributed in the interval v £ [0, 2], where v = A/ao is 
the normalized amplitude of the sources with respect to the 
dispersion of the field filtered with the standard MF. 

7.2 Empirical NP criterion 

For every maximum in a given image, it is possible to apply 
an empirical NP criterion to decide whether the maximum 
corresponds or not to a source. The quantities in equations 
(13) and (14) can be obtained from simulations in the follo- 
wing way: 

7.2.1 Momenta, amplitude and curvature 

The momenta <to, <ti and 02 (and, therefore, the quantities p, 
y s needed to know the value of the linear detector ip) can be 
straightforwardly calculated from the image. For every max¬ 
imum in the image, it is possible as well to measure directly 
its amplitude A and curvature k. The normalized curvature 
is easily obtained by Fourier transforming the image, mul- 
tiplying by q 2 and going back to real space. This gives the 
value of — at each point and k is obtained dividing by < 72 . 

7.2.2 Critical value <p* 

The critical value ip* that defines the acceptance region us- 
ing equation (13) can be obtained as well directly from the 
simulations. For each considered filter, it is in principle possi¬ 
ble to calculate <p* semi-empirically, inverting equation (17) 
(with the empirically obtained values of p and y a ) just as we 
did in the previous sections, and hence to proceed with the 
NP decision rule. Instead, since we are dealing with simula¬ 
tions, we will follow a fully empirical approach. 

The argument goes as follows. We fix the number den¬ 
sity of spurious detections, i.e., the number of maxima of 
the background that are misidentified as “sources” by our 
detection criterion. Then we simulate a set of images con- 
taining only background and filter them with the filter under 


study. We focus 011 the background maxima and try to de- 
termine the value of ip* that makes the NP rule to produce 
the specified number of spurious detections. For examplc let 
us consider that we perform Ntot = 50000 noise realisations 
and focus on what happens in a certain pixel (we choose 
the central pixel of the simulation in order to avoid border 
effects). For every realisation, we check if there is a maxi¬ 
mum at this position or not. If a maximum is present, the 
value of Lp is calculated. All the values of p obtained in that 
way are sorted into descending numbers (large to small). 
The value of <p» is then given by the <p corresponding to the 
r-th element (r = ni Ntot) of the sorted list (that is, <p* is 
the value of ip so that there are ni Ntot background maxima 
with 1 p > <p»). For this example, we considered ni = 0.05 
and therefore ni Ntot = 2500. 

7.2.3 Number of detections 

Once the value of <p* has been empirically determined we 
add a source with a Gaussian profile of dispersion R — 3 
pixels at the central position (pixel Al/2 + 1) of the unfiltered 
background image and then we filter it with the considered 
filter. We proceed to apply the detector to any maxima lo- 
cated at the pixel we are considering. Finally, n*, for each of 
the filters, will be the number of sources with an estimated 
ip > ip, divided by the total number of realizations Ntot- 

7.3 Results 

We have performed numerical simulations for our reference 
case (ni = 0.05, R — 3 and 7 = 0 ), assuming a uniform dis¬ 
tribution with v c = 2. We have compared the performance 
of the filters and the empirical NP detector in the simulated 
images with the theoretical predictions as a function of the 
parameter a. For each a value and each filter, we have done 
five sets of simulations of the background, each one with a 
number of realizations large enough to have 5000 of them 
containing a maximum in the central pixel^. We used these 
simulations to obtain ip * as explained before. 

In the figure 12 we present the results from the simu¬ 
lations for this case and the comparison with the theoreti¬ 
cal calculations. The lines in this plot show the theoretical 
results for each filter, the triangles the result from the sim¬ 
ulations for the BSAF and the squares the results for the 
MF. We concentrate on the BSAF, which corresponds to 
the dot-dash line. As we mentioned in previous sections, the 
BSAF significantly improves the standard MF for 7 = 0 . 
The simulations follow the theoretical results well. In the 
region where a ~ 0.3, there is a small deviation from the- 
ory which we believe is related to the faet that we are close 
to the scale of the pixel, but still, significantly close to the 
expected theoretical value. 

7.4 The estimation of the amplitude of the source 

We can estimate the amplitude of a source using the unbia- 
sed and maximum efficient estimator from equation ( 20 ) and 

i The total number of realizations in each set of simulations 
needed to obtain 5000 of them with a maximum in the central 
pixel was ~ 50000 — 60000. 
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Figure 12. Number density of detections from theory and simu¬ 
lations versus the filter parameter a for the BSAF and MF. The 
solid and dot-dash lines represent the theoretical number density 
n* and the squares and triangles are the results from the simula¬ 
tions for u c = 2, nj* = 0.05, R = 3 and 7 = 0, for the MF and the 
BSAF respectively. 


parameters have been chosen at each case. The points and 
the error bars are calculated as the average and the disper¬ 
sion of the detected sources that fali in each of the amplitude 
bins from a total of ~ 10000 detected sources. We find a sim- 
ilar positive bias in the determination of the amplitude for 
the BSAF (a = 0.3, c = —0.86) and MF (a = 0.3). How- 
ever, the error bars corresponding to the BSAF are slightly 
smaller than those of the MF. 

In the right panels, we give the results for a uniform 
distribution of sources with A £ [0,5]oo. The simulation 
parameters are the same as before (ni = 0.05, 7 = 0 and 
R = 3). As before, the points and the error bars are cal¬ 
culated as the average and the dispersion of the detected 
sources that fail in each of the amplitude bins from a to¬ 
tal of ~ 20000 detected sources. We find that the BSAF 
(q = 0.3, c = —0.79) is unbiased for bright sources whereas 
the estimation of the amplitude in the case of the standard 
MF (a = 1) shows some bias even for bright sources. There- 
fore, the BSAF with a = 0.3 outperforms the standard MF 
in both the detection and estimation for this distribution. 

The faet that sources with small amplitudes are sig- 
nihcantly affeeted by a positive bias can be explained tak- 
ing into account that these sources are more easily detected 
if they lie over a positive contribution of the background. 
This contributes systematically to the overestimation of the 
amplitude. We would like to point out that this estimator 
produces appreciably better results than a naive estimation 
using direetly the measured values at the maxima. 
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Figure 13. Real versus estimated amplitudes of simulated 
sources for the BSAF (top panels) and the standard MF (bottom 
panels). Two cases have been considered: left panels show the re¬ 
sults for sources uniformly distributed in the interval A £ [0, 2] tro 
whereas right panels show the results for sources distributed in 
A £ [0, 5]fjQ. For all four cases, the parameters of the simulations 
are = 0.05, 7 = 0 and R = 3 pixels. 


then compare it with the amplitude that we have randomly 
generated. In figure 13, we plot the real amplitude versus 
the estimated one for the BSAF (top panels) and the MF 
(bottom panels). 

Two amplitude regimes have been explored. In the left 
panels, we have simulated a uniform distribution of weak 
sources, A £ [0,2]oo- The parameters used for these simu¬ 
lations are ni = 0.05, 7 = 0 and R = 3. The optimal Hiter 


8 CONCLUSIONS 

Nowadays, the detection of compact sources on a back¬ 
ground is a relevant problem in many Helds of science. 
A number of detection techniques use linear filters and 
thresholding-based detectors. Our approach to the problem 
of detector design is different. We use a Neyman-Pearson 
rule that takes into account a priori information of the dis¬ 
tribution of sources and the number density of maxima to 
define the region of acceptance. 

In our case, we take advantage not only of the amplifi- 
cation but also of the spatial information: the curvature of 
the background is different from that of the sources, and we 
use this to improve our detection rule. 

The background is modclled by a homogeneous and 
isotropic Gaussian random Held, characterized by a scale- 
free power Spectrum P(q) oc q \ 7 > 0. 

We design a new filter that we call BSAF in such a way 
that the use of our improved detection rule based on ampli- 
fication and curvature on the filtered field will inerease the 
number of detections for a fixed number of spurious sources. 
We generalize the funetional form of this filter, as well as 
other standard filters, and introduce another degree of free- 
dom, a, that allows us to filter at any scale, including that 
of the source R. We have shown the benefits of filtering at 
scales smaller than R, which significantly improves the num¬ 
ber of detections. 

As an example, we have considered two different dis¬ 
tributions of sources. A uniform distribution in the interval 
v £ [ 0 , 2 ] and a scale-free power law distribution in the in¬ 
terval v £ [0.5, 3] (where the threshold v corresponds to the 
field filtered with the standard MF), i.e. we are consider- 
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ing weak sources. The BSAF has proven to be significantly 
better than the standard MF, the SAF and MH wavelet in 
certain cases. In particular, we have considered a reference 
case with parameters 7 = 0, ni = 0.05 and 7? = 3, where 
the improvement in the number of detections of the BSAF 
at a = 0.3 with respect to the standard MF is ~ 40%. We 
have also tested the performance of the filters for a mixture 
of weak, intermediate and bright sources. For a uniform dis¬ 
tribution with v £ [0, 5] and for a scale-free distribution 
with v £ [0.5, 5], the BSAF also improves the MF. However, 
for a scale-free distribution with v £ [3,5], i.e., dominated 
by bright sources, we find that the optimal BSAF defaults 
to the standard MF, which gives the maximum number of 
detections in this case. 

We find that the BSAF gives in any case the best per¬ 
formance among the considered filters. Indeed, the SAF and 
the MF are particular cases of the BSAF and the strategy 
we follow, i.e. maximization of the detections, guarantees 
that the parameters of the BSAF will default to the best 
possible of these filters in each case. In addition, we also 
find that the BSAF performs at least as well as the MH in 
all the considered cases. Therefore, the number density of 
detections obtained with the BSAF will be at least equal to 
the best of the other three filters, and in certain cases supe- 
rior. However, in some other cases, the gain is small and it 
is justified to use an analitically simpler filter. Our results 
suggest that for power law spectra, from the practical point 
of view, one could use the BSAF when 0 ^ 7 ^ 1 since, in 
this range, clearly improves the number of detections with 
respect to the other filters. However, for 7 <; 1.0 the usage of 
the MH is justified due to its robustness (since it has an ana- 
lytical form) and it gives approximately the same number 
of detections obtained either with the BSAF or MF. 

For all the studied cases of source distributions (except 
for the one dominated by bright sources) and fixing the val- 
ues of 7 , ni and R, we find that the optimal parameters 
of the BSAF are only weakly dependent 011 the distribution 
of the sources. We have done some simple tests in order 
to study the robustness of the method when the knowledge 
about the source pdf or the background spectral index is not 
perfect. We find that the values of the optimal filter param¬ 
eters vary slightly when we assume that the source distri¬ 
bution is uniform when, in reality, it is scale-free and vice 
versa. The uncertainties in the cut-off values of the source 
pdf affect the number of detections, but in a similar way for 
all the filters, and therefore the relative behaviour of the 
filters do not change. Errors in the estimation of the spec¬ 
tral index 7 reduces the efectiveness of the BSAF, but it 
still outperforms the other filters. All of this indicates that 
our detection scheme is robust against uncertainties in the 
knowledge of the distribution of the sources and spectral 
index. 

To test the validity of our results in a practical example, 
we have tested our ideas with simulations for the uniform 
distribution (using our reference case ni = 0.05, R = 3, 
7 = 0 ) and find that the results follow approximately the 
expected theoretical values. 

Regarding source estimation, we propose a linear esti- 
mator which is unbiased and of maximum efUciency, that we 
have also tested with simulations. 

The ideas presented in this paper can be generalized: 
application to other profiles (e.g. multiquadrics, exponen- 


tial) and non-Gaussian backgrounds is physically and as- 
tronomically interesting. The extension to include several 
images (multi-frequency) is relevant. The generalization to 
two-dimensional data sets (flat maps and the sphere) and 
nD images is also very interesting. Finally the application 
of our method to other helds is without any doubt. We are 
currently doing research in some of these topics. 
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APPENDIX A: 

The ratio L(v,k\ v a ) = n(v, k\v 3 )/rib{v, k) can be explicitly 
written as 

L(v,k\u s ) = (Al) 

and taking into account the NP criterion for detection, we 
find 

/»OO 

L(i/,«) = / dv s p(v s )L(v, k\u s ) > L*, (A2) 

Jo 

where L* is a constant. By differentiating the previous equa- 
tion with respect to <p 

^=j du s p(u s )u s e^-i^ +y >" > 0. (A3) 

Therefore, L(i/, k) > L* is equivalent to <^? > </?*, where is 
a constant, i.e. <p(v,k) given by equation (14) is a sufficient 
linear detector. 


APPENDIX B: 

Let us assume a linear estimator combination of the nor- 
malized amplitude v and normalized curvature k with the 
constraint 

v 3 = Av + Bk. (Bl) 

If the estimator is unbiased, i.e. (v s ) = v 3 , taking into ac¬ 
count that (v) = v s and (/c) = v 3 y s , we obtain the constraint 

A + By a = 1. (B2) 

On the other hånd, the variance is given by 

a 2 0s = A 2 + B 2 + 2 pAB, (B3) 

where we have taken into account that a„ = a^ = 1, (vk) = 
p + y s v 3 . By minimizing the previous expression with the 
constraint (B2), one obtains 


1 1 -py s 1 y 3 - p 

ys+p 1 — p 2 yl + p 1 - p 2 ’ 

(B4) 

Therefore, one obtains: 


Vs = -o—-, 

Vs + P 

(B5) 

2 _ 1 
^ “ yl+p ' 

(B6) 
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